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1 Introduction 

Lovasz and Schrijver (1991) have proposed a very intriguing successive convex relaxation pro- 
cedure for 0-1 integer programming problems. The procedure called iV+, to be defined shortly, 
when applied to a classical linear programming (LP) relaxation of the stable set problem (with 
only the edge and nonnegativity constraints) produces a relaxation for which many well-known 
inequalities are valid, including the odd hole, odd antihole, odd wheel, clique, and even the 
orthonormal representation inequalities of Grotschel, Lovasz and Schrijver (1981). This implies 
that for many classes of graphs, including perfect (for which clique inequalities are sufficient) 
or t-perfect graphs (for which odd hole inequalities are sufficient), one can find the maximum 
stable set by using the N + procedure. 

The iV_|_ procedure is a strengthening of another procedure, called N, also introduced by 
Lovasz and Schrijver. The main difference between the two procedures is that N + involves 
a positive semidefinite constraint. When applied to a linear programming relaxation, N will 
produce another (stronger) LP relaxation while N + will produce a semidefinite relaxation. For 
the stable set problem, Lovasz and Schrijver have shown that the relaxation produced by N is 
much weaker than the one derived from N+. 

In general, it is however not clear in which situations the procedure iV+ is better or signif- 
icantly better than N; especially, when N and N + are applied iteratively. In this paper, we 
try to shed some light on this question. We generalize certain properties derived by Lovasz 
and Schrijver. We also identify certain situations in which N produces the same relaxation as 
N+. Several examples are discussed throughout the paper, including one in which the number 
of iterations of the JV+ procedure needed to derive the convex hull of 0-1 points is equal to the 
dimension of the space, hence resolving a question left open by Lovasz and Schrijver. 

In the next section, we review the lift-and-project procedures and their basic properties. 
Section || includes upper bounds on the number of major iterations required by such procedures. 
Section |4] discusses techniques to prove lower bounds on the number of major iterations required. 
Sections |5| and ^ include geometric properties and characterizations of the convex relaxations 
produced by the procedures. 

2 Lovasz- Schrijver procedures N and N + 

First, we describe two lift-and-project procedures proposed by Lovasz and Schrijver (1991) which 
produce tighter and tighter relaxations of the convex hull of 0-1 points in a convex set. In what 
follows, ej is the jth unit vector and e is the vector of all ones. The sizes of e and tj will be 
clear from the context. The cone generated by all 0-1 vectors x £ M. d+1 with xq = 1 is called Q. 
Let K C Q denote a convex cone; for example, K could be a polyhedral cone obtained from a 
polytope P in [0, l] d via homogenization using a new variable x$. That is, if 

P = {x£R d : Ax < b, < x < e}, 
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then 



K :- { ( ^° ) € : ,l.e < .r () b. < .r < .r„< 



We are interested in determining (or approximating) Kj, the cone generated by all 0-1 vectors 
of K. 

Let K*, Q* denote the dual cones of K and Q under the standard Euclidean inner-product, 
e.g., 

K* := {s G R d+1 : x T s > 0, Vi £ K}. 

S d+1 denotes the space of (d + 1) x (d + 1) symmetric matrices and denotes the cone of 

(d + 1) x (d + 1) symmetric, positive semidefinite matrices. For a matrix A G S d+1 , we denote 
its positive semidefiniteness by A y 0. When we deal with the duals of convex cones in the 
space of (d + 1) x (<i + 1) matrices (or in the subspace of the symmetric matrices), we always 
take the underlying inner-product to be the trace inner-product (or Frobenius inner-product): 
(A, B) := Tr{A T B). 

Let diag : S d+l — > M. d+1 denote the linear operator which maps a symmetric matrix to its 
diagonal. Then its adjoint diag* : IR rf+1 — > is the linear operator Diag(-) which maps a 

vector from M. d+1 to the diagonal 

atrix in S d+l whose (i, i)th component is the ith component of the original vector. 



Definition 2.1 (Lovasz and Schrijver (1991)) A (d + 1) x (d + 1) symmetric matrix, Y, 
with real entries is in M(K) if 

(i) Yeo = diagiY), and 

(ii) u T Yv > 0, VnG Q* , v G K*. 



Lovasz and Schrijver note that condition (ii) of the above definition is equivalent to YQ* C K 
(where YQ* = {Yx : x G Q*}), or: (ii)' Ya G K for all i G {1, . . . ,d} and Y(e - a) G K 
for all i G {1, . . . , d}, since the extreme rays (after normalization) of the cone Q* are given by 
ext(Q*) = {ei, e 2 , . . . , e d , (e - ei), (e - e 2 ), . . . , (e - e d )}. 



Definition 2.2 (Lovasz and Schrijver (1991)) Y G M+(if) z/Y G M(K) andY is positive 
semidefinite. 



Observe that if we take any x G K (not necessarily integral) and consider Y = xx , 1" satisfies 
Y >z and also (ii)', but this specific 1" satisfies (i) if and only if x is such that Xi(xo — Xi) = 
for all i, i.e. x corresponds to a 0-1 vector. 
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Now, we define the projections of these liftings M and M + : 

N(K) := {diag(Y) : Y G M{K)}, 

N+(K) := (diag(F) : Y G M+(K)}. 

The above argument regarding xx T shows that Kj C N+(K) C N(K) C X, the last inclusion 
following from the fact that Y(eo - e;) G K and Yei G if imply that x = Yeo G i^. 

If P is a polytope (or any convex set) in [0, l] d then we simply write N + (P) to represent 
■|x : ( ^ ^ G A r +(i ; T)|' where -RT is the cone obtained via homogenization using the variable xq, 
and similarly for N(P). We also let M(P) = M(K) and Af+(P) = M + (K). 

We should point out that the definition of M (or M+) is such that M{K) depends only on 
the sets K D {x : X{ = xq} and K (~) {x : Xi = 0} for all i. In particular, we have: 

Lemma 2.1 Lei K and K' be such that K Pi {x : X{ = xq} = K' n {x : Xi = xq} and K f~) {x : 
Xi = 0} = K' n {x : Xi = 0} /or all % G {1, . . . , d}. Then M{K) = M(K') (and N(K) = N(K') ) 
and M + {K) = M + (K') (and N + (K) = N + {K') ). 

For example, P = {x G M. 2 : \\x — 0.5e| I2 < \} and P' = {x G M? : \\x — 0.5e||i < 0.5} (see Figure 
0) have the same N(P) = N(P'). 




Figure 1: Two convex sets with the same N + (-). 

The definitions of M, N, M + and N + are invariant under various operations including 
flipping coordinates xi — > (1 — x.{) for any subset of the indices {1,2, ... ,d}. More formally, 

Proposition 2.2 (Lovasz and Schrijver (1991)) Let A be a linear transformation mapping 
Q onto itself. Then 

N(AK) = AN(K) and N + (AK) = AN + (K). 
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One crucial feature of the operators N and N+ is that they can be iterated. The iterated 
operators N r (K) and N r + {K) are defined as follows. N°(K) := K, N%(K) := K, N r (K) := 
N(N r ~ 1 (K)) and N r + (K) := N+(N+~ 1 (K)) for all integers r > 1. Lovasz and Schrijver (1991) 
show that, even without the positive semidefiniteness constraints, d iterations are sufficient to 
get Kj: 

Theorem 2.3 (Lovasz and Schrijver (1991)) 

K D N(K) D N 2 (K) D...D N d (K) = Kj 

and 

K D N + (K) D Nl(K) D...D Nf(K) = Kj. 



Let a T x < axQ be a valid inequality for Ki. Then the smallest nonnegative integer r such 
that a T x < axQ is valid for N r (K) is called the N-rank of a T x < axo relative to K . The 
iV + -rank of a T x < axo relative to K is defined similarly. The above theorem states that these 
ranks are at most d for any valid inequality. The iV-rank (resp. iV + -rank) of a cone K is the 
smallest nonnegative integer r such that N r {K) = Kj (resp. N+(K) = Kj). 



Theorem |2.3| can also be proved using the results of Balas (1974), see Balas, Ceria and 
Cornuejols (1993). Our interest, in this paper, mostly lies in understanding the strength of N + 
in comparison to N. Consider the stable set polytope on a graph G = (V, E) defined as the 
convex hull of incidence vectors of sets of non-adjacent vertices (known as stable sets). Let 
FRAG be the relaxation defined by the edge constraints (xj + Xj < 1 for all edges G E) 
and the nonnegativity constraints (x{ > for all i E V). Then N(FRAC) is exactly equal 
to the relaxation obtained by adding all odd hole inequalities, saying that Yli^c x i — 2 — 
for any odd cycle C with no chords. However, many more complicated inequalities have small 
iV + -rank. Lovasz and Schrijver (1991) prove that odd hole, odd antihole, odd wheel, clique 
and orthogonal inequalities all have iV_|_-rank at most 1, relative to FRAC. These results are 



proved using Lemma 3.5 of next section, except for the orthogonality constraints. In contrast, 
the TV-rank of a clique inequality for example is equal to p — 2 where p is the size of the clique. 
Note that the separation problem for the class of clique inequalities is NP-hard (and so is the 
problem of optimizing over the clique inequalities, see Grotschel, Lovasz and Schrijver (1981)). 
iV_|_, however, leads to a polynomial-time separation algorithm for a broader class of inequalities. 
This, and more generally the importance of N and iV+, stems from the following result. 



Theorem 2.4 (Lovasz and Schrijver (1991)) If we have a weak separation oracle for K 
then we have a weak separation oracle for N r (K) and N^_(K) for any fixed constant r. 

Together with the equivalence between (weak) optimization and (weak) separation (Grotschel 
et al. (1981)), this implies for example that the stable set problem can be solved in polynomial 
time for any graph with bounded iV_|_-rank (Lovasz and Schrijver (1991)). 

Next we study the upper bounds on N- and iV + -ranks of inequalities and convex sets. 
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3 Upper bounds on the N- and 7V + -rank 

Lovasz and Schrijver give some ways to upper bound the iV-rank of an inequality. They show 
the following. 

Lemma 3.1 (Lovasz and Schrijver (1991)) 

N + (K) C N(K) Q(Kn{x:Xi = 0}) + (K n {x : Xi = x }) , for all i € {1, 2, ... , d}. 

Lovasz and Schrijver (1991) define an operator iVo by: 

N (K)= P| {(K n {x : Xi = 0}) + (K n {x : Xi = x })} . 
i=l,-,d 

Thus, N(K) C Nq(K). The iterated operator iVJ, iVo-rank of inequalities, polytopes and convex 
cones are defined analogously to the corresponding definitions of 7V-and iV + -ranks. 



Lemma 3.1 shows that an inequality will be valid for N(K) if it is valid for K n {x : X{ = 0} 
and K H {x : X{ = xq} for some i. In order to iterate Lemma [O], we first need the following 
lemma. It is stated in terms of the faces of Q, which can be obtained by intersecting Q with 
hyperplanes of the form {x : Xi = 0} or {x : x% = xq\. Similar insights for a procedure related 
to the N- procedure were discussed by Balas (1974). 

Lemma 3.2 Let F be any face of Q. Then 

N (K n F) = N(K) n F. 

Similarly for N + and Nq. 



Proof. "C" is clear from the definitions. For the converse, let x £ N(K) n F. This means that 
there exists a matrix Y G M(K) with Yeo = x. Since Yei G K C Q and Y(e$ — ej) £ K Q Q and 
their sum + F(eo — e%) = Yeo belongs to the face F of Q, we have that Yei and Y(eo — ei) 
must belong to F, by definition of a face. Thus, Yej £ K C\ F and F(eo — e^) G n F for all i 
implying that Y G M(K n F) and x G iV(F n F). The proof for N + is identical. ■ 



Iterating Lemma 3.2, we get: 



Corollary 3.3 Let F be any face of Q. Then, for any r, 

N r (K n F) = N r (K)nF. 

Similarly for N + and Nq . 
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Repeatedly using Lemma |3.l| and Lemma ^2 (or Corollary |3.3j ), we can derive a condition 



that an inequality be valid for N r (K). This, in particular, proves Theorem 2.3 



Theorem 3.4 N r + (K) C N r {K) C N^{K) C Nq(K) where 

Nq(K) = P| (if n {re : = /or z G Jo and Xj = /or i € Ji 

{JC{i,...,d}:|j|=r}{(j 0( j 1 ) partitions of J} 

We should point out that even though iV (if) = JVo(if) and Nf(K) = N d (K) = N${K) = 
Nq(K), Nq(K) is not necessarily equal to Nq(K), if 2 < r < (d — 1). For example, for 
if = { x g Q : xi + x 2 + x 3 < 1.5x }, one can show that (1, 0.5, 0.5, 0.5) G {N%{K) \ N%(K)). 

For N+(K), Lovasz and Schrijver (1991) give a different condition for the validity of an 
inequality. In the statement of the next lemma, the assumption that a > is without loss of 
generality (by flipping coordinates if necessary, as shown in Proposition |2.2|). 



Lemma 3.5 (Lovasz and Schrijver (1991)) Let a > 0. Then a T x < axo is valid for 
(K n {x : Xi = xq}) for all i such that a\ > 0, implies a T x < oxq is valid for N+(K). 

As mentioned previously, the result that clique, odd hole, odd antihole, odd wheel inequal- 
ities for the stable set problem have iV + -rank 1 follows from the above lemma. For the stable 
set problem (as for many combinatorial optimization problems), there exists several important 
constructions to derive facet-defining valid inequalities from other facet-defining inequalities. 
The simplest is cloning a clique at a vertex v, which consists of replacing the vertex by a clique, 
replacing all the edges incident to v by corresponding edges incident to all clique vertices and 
substituting in the inequality the variable for v by the sum of the variables of the clique vertices. 
It can easily be shown that the resulting inequality is valid and facet-defining if the original 
inequality was a non-trivial (i.e. different from the nonnegativity constraints) facet-defining in- 
equality. In general, it is not clear how cloning influences the iV + -rank of an inequality. However, 
if we perform cloning at the center vertex of an odd wheel inequality, Lemma |3.5| implies that 
the iV + -rank still remains equal to 1. If we perform cloning at one or several vertices of an odd 



wheel, odd hole or odd antihole inequality, Lemma 3.5 implies that the iV_|_-rank is at most 2. 
Indeed, if we fix any variable (of the corresponding subgraph) to 1, the resulting inequality can 
be seen to be a linear combination of clique inequalities and hence valid for N + {FRAC). 



Lemma 3Jj can be extended to derive conditions under which the iV + -rank of an inequality 
is at most r. 



Theorem 3.6 Let a > and let I + = {i : en > 0}. // a T x < uxq is valid for 
(K H {x : Xi = xq, for all i £ I}) for all sets I C J + satisfying either of the following two con- 
ditions 
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1. \I\ = r, 

|/| < (r — 1) and Siej a« > a, 

f/ien a T x < aa^o valid for NL(K). 

Observe, however, that the result mentioned previously regarding cloning does not follow 



from Theorem 3.6 



Proof. We proceed by induction on r. For r = 1, the result is Lemma 3.5 



Assume now that r > 1, that the theorem was proved for (r— 1) (and for any inequality and for 
any convex set K), and that the hypothesis is satisfied for the inequality a T x < axo and r. From 
Corollary |3.3| and Lemma [D], we know that a T x < ax is valid for N r + (K) = N + (N r i T 1 (K)) 
if it is valid for N^T l (K) n {x : Xi = x } = N r + ~ l (K n {x : x { = x }) for all i G I+. This is 
equivalent to showing that a T x — aiXi < (a — a,i)xo is valid for iV^ -1 (K n {x : x% = xq}). 

Now there are two cases. If a — a, < then condition || implies that KPi{x : X{ = xq} = and 
thus any inequality is valid for iV^~ (K D {x : Xi = xq}) = 0. On the other hand, if a — ai > 0, 
we can use induction to prove the result. Indeed, conditions |l| and ^ for inequality a T x < oxq and 
r imply that conditions |l] and || are satisfied for the inequality a T x — aiXi < (a — ai)xo for r — 1. 
Thus, by the inductive hypothesis, aTx — aiXi < (a — Oj)xo is valid for iV^ -1 (K n {x : X{ = xq}), 
proving the inductive statement. ■ 



For the stable set problem, the above theorem implies that the iV + -rank of a graph is at 
most its stability number a(G), the cardinality of the largest stable set in G; this was proved 
in Corollary 2.19 of Lovasz and Schrijver (1991). More generally, if we consider a polytope P 
for which Pj is only described by inequalities of the form a T x < axo with a > (i.e. it is lower 
comprehensive, see Section then its 7V_|_-rank is upper bounded by the maximum number of 
variables that can be set to 1 in P to obtain a unique integral point of Pj (in which the other 
variables are thus set to 0). Similar, more complex, statements can be made if the polytope is 
not lower comprehensive. 



3.1 Example 1: Matching polytope 

Consider the complete undirected graph on the vertex set V; let E denote its edge set. Let 

P := {x e R E : x(5(v)) < 1,V« G V, < x < e}. 

In the above, 5(v) is the set of edges in E that are incident on v; for S C E, x(S) represents 
Xj. For S C V, let E(S) refer to the set of edges with both endpoints in S. Then the 

ies 
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matching polytope for the complete graph is 

Pi := conv{Pn{0, 1} E } . 

Edmonds (1965) proved that 

Pi = jx e P : x{E(S)) < ^ ~ 1 for all S C V such that |5| is odd } . 
The above inequalities are known as the blossom inequalities. 

Theorem 3.7 (Stephen and Tuncel (1999)) The N + -rank of the inequality 

\S\-1 



x(E(S))< 



with respect to P is ^ 1 , 



The fact that the A r + -rank is at most ^ 1 also follows directly from Theorem |3.6| , Observe 
that since d is |V|(|V| — l)/2, we derive that the iV_|_-rank of P is equal to (y/l + 8d — l)/4 if 
|V| odd and, (y/T+M- 3)/4 if |V| even. 



From Theorem 3.7, the TV-rank of the blossom inequality on S is at least J — k — . Furthermore 



using Theorem 3A with J being the complement of a complete bipartite graph on ^ 1 and ^-jp- 



vertices on each side, we derive that the iVo-rank of a blossom inequality is equal to 4 — . 
This uses the fact that P is an integral polytope if and only if the underlying graph is bipartite. 



(\S\ — 1) 

Thus, the iV-rank is at most 4 ■ These bounds are to be compared with those derived from 
Corollary 2.8 of Lovasz and Schrijver (1991) (since a matching in a graph can be viewed as a 
stable set in its line graph). Their results imply a lower bound of (\S\ — 2) and an upper bound 
Of 1)2-1. 



3.2 Example 2 



Consider 
K : = 
Then 



, <E f':"" J : x(S) < ^x , for all S C {1, 2, . . . , d} such that \S\ = ^ + 1, < x < x 



x 



Kl = {{ X x) eRd+1: f^ Xi ~ 2^°' °^^oej 



Theorem 3A implies that the iV-rank of Xa=i x i — 2 X ° ^ s a ^ mos t — 2), while Theorem | 



implies that the iY + -rank is at most |. These bounds are actually attained and this is discussed 



in Section [4^. We also show in that section that the positive semidefiniteness constraint does 
not help for many iterations. 
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4 Lower bounds on the N- or 7V + -rank 

In this section, we provide lower bounds on the N- and 7V+-rank. We also show a situation in 
which the positive semidefiniteness constraints do not help at all and both the iV-rank and the 
iV + -rank of a polytope is d. 

We first provide a way to derive points in N + {P) in certain cases. For x G M. d define 

Cj) . = { x i if * ^ i; 
4 " \ if * = j. 

So, x^ = x — x^e,-. Throughout this section, let K = | ^ ^ ^ : x G P, A > 1. 

Theorem 4.1 Let i£P suc/i i/tai 

x^ and (x^ + ej) G P, for all j such that < Xj < 1. 

JTten x G iV+(P). 

Simply stated, this result says that if we can replace any coordinate of x (strictly between and 
1) by and 1 and remain in P then x 6 N + (P). 



Proof. We define 



Y(x) := ( ^ ) (l,x T ) + Diag 



/ \ 

xi — xf 

X 2 - x| 



By definition, Y(x) G y(x)e = diag(T(x)) = G K. Moreover, 



1 



therefore, Y(x)ej G .K" for all j G {1, 2, . . . , d}. Similarly, 

1 

x 

therefore, F(x)(eo — ej) G K for all j G {1, 2, . . . , d}. Finally, since 

/ \ 



Y(x)(e - ej) = (1 - xj) ( _^ ) , for all j G {1, 2, . . . , d}; 



Diag 



xi — x 



X 2 - X2 

\x d -x 2 d J 



y and ( | ) ( l..r' j >; 



EFFICIENCY OF LIFTING PROCEDURES 11 



for all < x < e, we have Y(x) y 0. Therefore, Y{x) G M + {P) and x G N + {P) as desired. 



Figure 2: Convex set satisfying the condition of Corollary 4.2 . 
As a corollary, we derive the following (see Figure for an illustration). 



Corollary 4.2 Let P be such that (Pn{x : Xj = 0})+ej = Pn{x : Xj = 1} for all j G {!,■■■ ,d}. 
Then 

N + (P) = N(P) = N (P) = p| {x:x (j) eP}. 



Proof. Let C = f]je{i, -,d}{ x '■ xif) e p }- B y Lemma 0, we know that N+(P) C iV(P) C 
^Vo(-P) C C. On the other'hand, Theorem |4~T| shows that C C N + (P). U 



In the proof of Theorem 4.1, we constructed aFe M + (P) such that a certain x G P would 
also be in N + (P). The idea of the proof suggests a stronger technique to achieve such a goal. 
We define 

/ ° B \ 



Y(x) :-- 



1, x T ) + Diag 



Xl — Xi 



V. 



+ 



T 
P(x) / ' 



where B(x) G S , diag(P) = 0. Then clearly we have G Y(x)eo = diag(y(x)). 

Moreover, using the Schur complement of (Y(x))oo in Y(x), we have 

(xi — x\ ' 
■■ I > (J. 
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The latter can be assured in many simple ways, for example by diagonal dominance: It suffices 



to choose Bij such that 




# of nonzeros in column i # of nonzeros in column j I 

The entries of such a B{x) will be further restricted by the condition Y{x)ei E K for every 
i E {1, 2, . . . , d} and Y(x)(eo — ej) E K for every i £ {1,2, ... , d}. If this condition is verified for 
some B(x) then the above argument would imply x E N + (P). In the case of Theorem |4.1| , we 
utilized diagonal dominance; because of the special structure of P, we could choose B[x) := 
and satisfy all the conditions for x E N + (P). 

4.1 Example 3: Infeasibility detection 

We now give an example where both N and N + require d iterations, showing that Theorem 
cannot be improved. This result was independently obtained by Cook and Dash (1999) who 
also show additional results regarding the rank of inequalities. Previously, the worse example 
known in terms the number of repeated N + iterations needed to obtain Kj was the matching 
polytope results of Stephen and Tungel (1999) where the iY + -rank was of the order of yfd. 



Let 

P{p) := I x E ~ " 



r 



Theorem 4.3 For < p < d, N + (P(p)) D P(p—1). Furthermore, P(l) / while Pi(d— 1) 
Thus, the N + procedure requires d iterations to prove Pi(d — 1) = 0. 



Proof. Follows from Corollary |4.2j. (In fact this corollary characterizes precisely N+(P(p)).) 



One interesting feature of the example above is that P(d— 1) can be described by 2 d inequal- 
ities, contains no integral point, but no inequality can be removed without creating an integral 
point. This is actually an extreme situation in this regard as shown by the following result of 
Doignon (1973). Suppose we are given a set of m linear inequalities 



a 



fx < bi, for all i G J, 



where x G M. d and \J\ > 2 d . A theorem of Doignon (1973) implies that if this system does 
not contain any integer points then there is a subsystem (of this system) with at most 2 d 
inequalities which does not have an integer solution. Doignon's Theorem is an integer analog of 
Helly's Theorem. 
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4.2 Example 2, continued 



In Section 3.2, we have shown that the iV-rank and the iV_i_-rank of 



K := <[ J G IT +i : a;(5) < ~z , for all S C {1, 2, . . . , d} such that \S\ = - + 1, < x < x 
are at most (<i — 2) and (f/2, respectively. Here we claim that these bounds are attained. 

Theorem 4.4 The N-rank of Yli=i x i — i relative to K is (d — 2). The N + -rank of the same 
inequality relative to K is ^. 

oreover, for r < ^ — + 3/2, i/ie optimum values of 

max{e T x : x £ N r (K)} and max{e T rc : x G N+(K)} 

are the same. 

Our proof of the first statement of the theorem, saying that the iV-rank is (d — 2) is lengthy 
and is not included here. The proof of the remainder of the theorem appears partly in this 
section and partly in the Appendix. The theorem indicates that the positive semidefiniteness 
constraint does not help for (d/2 — o(d)) iterations. 



Unfortunately, neither Theorem |4.1| nor Corollary [4.2| is useful here. Instead, exploiting 
the symmetry (and convexity of N{K) and N + {K)), we will only consider points in N r {K) or 
N+(K) such that Xi takes only three possible values, 0, 1 and a constant a. Letting no denote 
the number of Xi set to and letting n\ denote the number of Xi set to 1, we define c(r, no,n\) 
to be the largest common value a of the remaining (d — uq — n\) coordinates of x such that 
x G N r (K). We define c+(r, nQ,n\) similarly with respect to N!L(K). 



By symmetry, such a point x belongs to N r {K) (resp. to N^K)) if there exists a symmetric 
matrix Y G M(K) (resp. Y G M + {K)) of the form 

/ 1 e T ae T \ 

e ee T aee T 
Y{n , ni ;a,l3) := 

\ae aee T (a — (3)1 + f3ee T ) 

for some value (5; here the columns of Y are partitioned in the way that the first column 
corresponds to the homogenizing variable xq, the next n\ columns correspond to those Xj that 
are set to one, the next no columns correspond to those Xj set to zero and the remaining 
{d — uq — n\) columns correspond to the remaining x^'s (which are set to a). 

For r = and n\ < d/2, we see by plugging x into the description of K that 



d/2~n 



i- if n < d/2 -1, 



c(0, n , ni) = c+(0, n , m) = { d / 2 + 1 -^ 

1 otherwise. 
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For r > 0, the condition that Y G M r (K) is equivalent to ^ < c(r — 1, no, ni + 1) (correspond- 
ing to Ye; G M r_1 (iT)) and £=| < c(r- 1, n + 1, ni) (corresponding to Y(e - e;) G M r_1 (K)). 
Eliminating /?, we derive: 

c(r - 1, n + l,ni) 

c(r,n ,ni) - 



1 — c(r — 1, no, ni + 1) + c(r - 1, n + 1, ni) 



The condition that Y y reduces to (by taking a Schur complement) (a — (3)1 + {f3 — a 2 )ee T y 
(where the matrices have size (d — no — n\) x (d — no — m), or a — /3 > and a — /3 + (d — no — 
n i)(/5 ~~ a2 ) > 0. This can be seen to imply that 

, , c + (r - l,n + l,ni) 

c+(r, no, ni j = mm 



1 - c+(r - l,n ,ni + 1) + c+(r - l,n + l,m) 
(d — no — ni — l)c + (r - l,n ,ni + 1) + I s 



d — no — ni 

Observe that the iV-rank (resp. the A r + -rank) of K is the smallest integer r such that 
c(r, 0,0) = \ (resp. c+(r, 0,0) = |)- Theorem [O] hence follows from the following proposi- 
tion. 



Proposition 4.5 



1. c(d- 3, 0,0) 



I + 53^6 ^ * s even 



I 2 + Tod=20 i f d is odd 

2. c+(d/2 - 1,0,0) > 0.5, 

3. For any r, no, rii such that r+no+ni < d/2— Vd+3/2, we have c(r, no, ni) = c + (r, no, ni). 

The proof of [l] is obtained by solving explicitly the recurrence for c; the details however, are 
omitted. The proof of the rest of the proposition is given in the Appendix. 



Theorem A.. 3 in the Appendix actually illustrates a peculiar behavior of the N+ operator 
(as well as the N operator) on this example. In cutting plane procedures, it is usual that 
the improvement due to the addition of a cutting plane (or a batch of them) decreases as the 



algorithm progresses. However, Theorem [A.3| shows that 

max{e T 3; : x G N r + (K)} = dc + (r,0,0) > d (l - — 

Hence, as illustrated on Figure [| for d = 500, the improvement in objective function value is neg- 
ligible for many iterations and only towards the end increases considerably. We should point out, 
however, that the procedures N and iV_|_ are such that the number of "important" inequalities 
generated in each iteration could potentially increase tremendously in later iterations. 
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Figure 3: Plot of c+(r, 0, 0) for d = 500 as a function of r. 

5 Additional properties 

A nonempty convex set P C ]R^_ is called lower comprehensive if for every i£P, every y G 
such that y < x is also in P. 

Definition 5.1 Let v G {0,1}^. .A convex set P C [0, l] d is said to 6e a convex corner with 
respect to v if there exists a linear transformation A of {0, l} d onto itself such that Av = and 
AP is lower comprehensive. 



Theorem 5.1 If P is a convex corner with respect to v G {0, l} d then so are N(P) and N+(P). 



Proof. By Proposition |2.2| and the definitions, it suffices to prove that if P is lower comprehen- 
sive then so are N{P) and N + {P). Let P be lower comprehensive and x £ N(P). It suffices to 
show that (x — Xjej) G N(P) for every j such that Xj > 0. Without loss of generality suppose 

j = 1 and xj > 0. Then there exists Y G M(P) such that Yeo = I I . Let 



Yi 



Yij if % ^ 1 or j / 1; 
otherwise. 
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Then using the fact that P is lower comprehensive, it is easy to see that Y 6 M{P). Since the 
above argument applies to every j such that Xj > 0, we proved that N(P) is lower comprehensive. 

We can prove that N + (P) is lower comprehensive by a very similar argument. We only have 
to note that if Y £ M + {P) then the corresponding Y constructed as above will be positive 
semidefinite (in addition to satisfying Yej £ K for every j £ {0, 1,2,..., d} and Y(eo — ej) £ K 
for every j £ {1,2, ... ,d}) since every principal minor of Y is a principal minor of Y and Y is 
positive semidefinite. ■ 

A similar fact, in a less general form, was observed independently by Cook and Dash (1999). 

6 General conditions on the strength of the semidefinite con- 



In this section, we derive general conditions under which the positive semidefiniteness constraint 
is not useful. This can be expressed in several ways as 



T{K) := cone { uv T + vu T : u £ Q* ,v £ K* } = cone {uv T + vu T : u £ ext(Q*), v £ ext(K*)} . 



straint 



• M(K) = M + (K), or as 

• N(K) = N+(K) or even as 

• max{c T x : x £ N(K)} = max{c T x : x £ N + (K)} for some given c. 



First, we rewrite condition (ii) of Definition 2.1. Since Y is symmetric, 




where 



Let's define 




Note that the cone (more specifically, the subspace in this case) dual to D in the space 

S d+i 

is 

the orthogonal complement of D. 




where E^ := + ejef. We have 
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Theorem 6.1 

M + {K) = M(K) if and only if T{K) + D L D S d+1 . 

Proof. By definition of the sets M(K), M + (K), we have 

M(K) = M+(K) ^ [T(K)\* HD= [T(K)\* nDn S^ +1 . 
Since the inclusion [T{K)\* n D D [T(iT)]* nDn 5^f +1 is clear, we have 

M{K) = M + {K) [T(K)]* ROC 

Noting that 

[T(iT)]* nDC {{T(K)\* n £))* D 

(we used the fact that S d + +1 is self dual under the trace inner-product, in the space S d+1 ) and 
that 

{[T(K)]* D D)* =T{K) + D*, 

we conclude 

M(K) = M+(K) if and only if T(K) + D L D S d+1 . 



This theorem completely characterizes when M and M + differ or are equal. To make the 
condition more easily tractable, we can give a more explicit description of T(K) + D^. Define 

F(K) to be set of all v = ^ e M. d+1 such that — v T x < vq is a facet of P (or, more generally, 

for non-polyhedral convex sets, F(K) describes a set of valid inequalities exactly characterizing 
P). Note that F(K) can be taken as the set of extreme rays of K*. We arrive at the identity 



T(K) + D J 



cone{(eiV T + vef), i G {1, 2, . . . , d}, v G F(K); 



[(eo 
{En 



.t 



eiX +v(e -eiY\ , i G {1, 2, . . . , d}, v G 
i€{l,2,...,d},} 



where we have used the fact that £ 0i - #ii G ^(if) since e; G F(K). So, M+(if) = M(K) iff 
for every x 6 we can express xx T as an element of the above cone (T(K) + D- 1 ). 

Consider the clique on four vertices and the corresponding LP relaxation FRAC of the stable 
set problem (with the edge and nonnegativity constraints only). For this example, 



Y := 



! 
V? 



i i 

3 3 


1 

5 1 

o o \) 



3 



i 
\ 



€ [T(if )]* n D; 



EFFICIENCY OF LIFTING PROCEDURES 



18 



but clearly Y ^ A proof of this is provided by the incidence vector of the clique inequality 

on the four vertices: 

(e - e x - e 2 - e 3 - e 4 ) T Y(e - ei - e 2 - e 3 - e 4 ) = --. 

This means, for x := (eo — e\ — e 2 — e% — e^), xx T is not in the convex cone (T(K) + D^. 
Now, we relate these findings to N(K) and N+(K). 

Corollary 6.2 // (T{K) + D- 1 ) D S^ +1 , then N r + (K) = N r (K) for every r > 0. 



Proof. Trivial for r = 0. By Theorem 6.1, the assumption of the corollary implies N + (K) 



N(K). By Theorem |J, N(K) C K. Thus, 



Now, applying Theorem 3.1 recursively, we obtain the desired result. 



Now we look at the weaker condition that N(K) = N + (K). 

Theorem 6.3 N+(K) = N(K) if and only if for every s G R d+1 , 

Diag(s) G T(K) + D 1 - + S^_ +1 implies Diag(s) G T(K) + D" 1 . 

Before proceeding with the proof, observe that, for any convex cone /C C S d+1 , we have 

[diag(/C)f = |s G : fliaj(s) G /C*} . (2) 



Proof. As in the proof of Theorem 6.1, we obtain 

N(K) = N+(K) ^ diag ([T(K)]* n -D) C diag (jT(iT)]* nfln . 



Using equation (|2|) and the proof technique of Theorem S.l, we find 

N(K) = N+(K) if and only if 
for every s G K d+1 , Diag(s) G T(if) + + implies Diag(s) G T(K) + Z> 
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We should compare this result to Lemma 1.2 of Lovasz and Schrijver (1991). Note that our 
result is also based on cone duality, we also characterize the dual cones of N(K) and N + (K); 
but, we only work in the space of symmetric matrices instead of the larger space of all matrices. 
As a result, the dependence of the characterization on the skew symmetric matrices is eliminated 
and our description is more explicit. 

Our ideas in the geometric characterizations above are also applicable in comparing the 
weaker procedure No to N. Recall 

N (K):= P| {(Kn{x:x i = 0}) + (Kn{x:x i = x })}. 

i=l,---,d 

We define 

M (K) := jy G M (d+i)x(rf+i) . Ye = Y T e = ding(Y),u T Yv > 0,Vu G Q*,v G K*} , 

the main difference with M is that Y is not necessarily symmetric. As is mentioned by Lovasz 
and Schrijver (1991), we have 

N (K) = {Ye :Y £ M (K)} . 

We further define 

T (K) := cone {uv T :u£Q*,v£ K*} , and D := {y G R^ 1 )*^ 1 ) . Y e = Y T e = diag(y)} . 
Then 

Y G M (K) iff Y G {[T (K)Y D D ) , 
where [T {K)]* is the dual of T (K) in R^+^x^ 1 ) under the trace inner-product. 

Theorem 6.4 M (K) = M(K) iff (T (K) + Dq) D {(e ie j - e je [ ) : i, j G {1, 2, . . . , d}} . 

Proof. As we showed, Mq(K) = [Tq(K)]* D Dq and it is clear from the definitions that M(K) = 
[To(K)]* n D n S d+1 . Note that 

D^ = span {e^ef - e ef , e^ef - : « G {1, 2, . . . , d}} . 

Thus, 

±(e ef - e t el) G (t (K) + Dfj , Vz G {1, 2, . . . , d}. 

Let denote the subspace of (d + 1) x (d + 1) skew-symmetric matrices with real entries. 

Therefore, 

(t (K) + Dfj ^ iff (t (K) + D^-) D {(eiej - e,ej) : i, j G {1, 2, . . . , d}} . 
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Now, using elementary cone geometry on closed convex cones and the definitions, we have the 
following string of equivalences: 



T (K)+d£) 5 {(eiej-ejej) :i,j G {1,2,..., d}} iff [T (K) + D^j ^ S d+1 

iff [T Q (K)\* nD c s d+1 

iff Mo (if) = M(if). 



Corollary 6.5 // (T (K) + D±) D |( ei eJ - e.-ef ) : i, j G {1, 2, . . . , d}} i/ien JVJ(JfQ = N r {K) 
for every r > 0. 

Proof. Trivial for r = 0. By Theorem |6.4j , the assumption of the corollary implies Nq(K) = 
N(K). By Theorem |J, N(K) C if. Thus, 

T (N(K)) D T (K) D {{ ei ej - e 3 ej) : i, j G {1, 2, . . . , d}} . 

Now, applying Theorem recursively, we obtain the desired result. ■ 

Let G denote the complete graph on d vertices, and consider the LP relaxation FRAC of 
the stable set problem on G. For every i,j G {1,2, ... ,d} such that i ^ j, we have 

(eo — &i — ej ) G K* and clearly ej , G (if* n Q* ) . 

Thus, for every i,j G {1, 2, . . . , d} such that « 7^ j, we have 

ej(eo — ej — ej) T and e^ef G Tq(K), and (e^ef — e^eg ) G -Dq 1 - 

This implies, 

(r (if)+Z^) D {( e , e J- ei ef) g {1,2, . . . ,d}} . 

Therefore, the condition of Theorem |6.4| is satisfied and we have Nq(FRAC) = N r (FRAC) for 
every r > 0. 

As in Theorem ^T^, we obtain 

Corollary 6.6 Nq(K) = N(K) if and only if for every s G R d+1 , 

Diag(s) G T (K) + Dq + S d+1 implies Diag(s) G T Q (K) + . 
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Instead of comparing M{K) and M + (K), or N{K) and N + {K), we might ask when are the 
set of optimal solutions of both relaxations the same. This is precisely when 



[Awr + ( f ) 2 [JMJor + ( f ) 



where z* is the optimal value of max <j c T x : ( I € N(K) 



x 



Sometimes we are only interested in the bound provided by the relaxation. This is equivalent 
to finding the smallest z for which ^ Z G [iV(K)]* and the smallest z + for which ^ c ^ e 
[iV + (if)]*. 



References 



Balas E. (1974). Disjunctive programming: Properties of the convex hull of feasible points. 
Management Science Research Report 348 GSIA, Carnegie Mellon University, Pittsburgh, PA, 
USA. 

Balas E., S. Ceria and G. Cornuejols (1993). A lift-and-project cutting plane algorithm for 
mixed 0-1 programs. Math. Prog. 58 295-323. 

Cook, W., and S. Dash, private communication, 1999. 

Doignon, J.-P. (1973). Convexity in cristallographical lattices. Journal of Geometry 3 71-85. 

Edmonds, J. (1965). Maximum matching and a polyhedron with 0,1-vertices. Journal of Re- 
search of the National Bureau of Standards-B 69B 125-130. 

Grotschel, M., L. Lovasz and A. Schrijver (1981). The ellipsoid method and its consequences in 
combinatorial optimization. Combinatorica 1(2) 169-197. 

Lovasz, L. and A. Schrijver (1991). Cones of matrices and set-functions and 0-1 optimization. 
SIAM J. Optim. 1 166-190. 

Stephen, T. and L. Tungel (1999). On a representation of the matching polytope via semidefinite 
liftings, Math. Oper. Res. 24 1-7. 



EFFICIENCY OF LIFTING PROCEDURES 



22 



APPENDIX 

In this appendix, we prove Proposition 4.5 and derive additional properties of c and c+. We 
first start with a few preliminary lemmas. 



Lemma A.l Assuming < b < a < 1 and p > 0, we have 

a 7 

1 — b + a 

and 

. / a (p-l)6+l\ , 

a > mm , — ) > b. 

\ 1 — b + a p 



Proof. First, a > 1 _^ +a follows from the fact that a > and a > b, and this implies also that 

• / a (p- 1)6+1 \ 
a > mm ( , r, , ■ !£ — '- . 

VI— o+a ' p / 

On the other hand, we have that 1 _^ +a > b iff a > b — b 2 + ab iff (a — — b) > 0, which 



(p- 1)6+1 
P 

both terms in the minimum are greater than b, and the second part of each inequality follows. 



follows by assumption. Furthermore, — — ^ > b iff (p — l)b + 1 > pb iff 1 > b. As a result 



This implies the following interlacing property. 

Corollary A. 2 For any r > 1 and any no, ni < ^ — r, we Ziaue that 

c(r - 1, no, n\ + 1) < c(r, no, ni) < c(r — 1, n + 1, n\) 

and 

c + (r - l,n ,ni + 1) < c+(r,n ,ni) < c + (r - l,n + l,ni). 
Proof. For r = 1 and no, n\ < | — 1, we have that < f(r— 1, no, ni+1) < f(r—l, no+1, ni) < 1 



where / = c or / = c+ by (hi). Lemma A.l now implies the result for r = 1. 



Proceeding by induction on r and assuming true the result for r — 1, we derive that f(r — 
1, no, ni + 1) < /(r — 2, no + 1, n\ + 1) < /(r — 1, no + 1, ni), which implies the result for r by 
Lemma I 



We can now get a lower bound on the coefficients c and c_|_. 
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Theorem A. 3 For any r, no, n\ such that s = r + uq + n\ < d/2, we have that 

c(r,n ,nx) > c + (r,n ,ni) > c(0,0,s) = fj 2 S . 

a/2 + 1 — s 

In particular, c+(d/2 — 1,0,0) > 0.5. 

This shows that the A^+-rank of K is d/2. 

Proof. For s < d/2, we have 

e(r,n ,ni) > c + (r,n ,m) > c+(r,0,n + n\) > c + (0,0,r + n + rii) = c(0,0,s), 
where we have used Corollary [A.2j twice. 



Lemma A. 4 Let l>a>b>c>0be such that a — b < b — c. Then 

i a b 
a-b < < b- c. 

l-b+a l-c+b 



Proof. The first inequality is equivalent to 

\l-c + b J \l-b + a J 

This inequality is satisfied since < b < a and < — 1 < 1 _\ +a — 1 (because < a — b < 

b-c). 

For the second inequality, we have that 

( b _\_ f a _\ = (b- c )(l- c ) _ (a-b)(l-6) 
Vl-c + 6 / \l-b + a J l-c + b l-b + a 

Moreover, we know that 1 — c> 1 — 6 > and (6 — c)/(l — c + b) > (a — b)/(l — b + a) >0 since 
< a — b < b — c. Multiplying these two inequalities together, we get the desired inequality. 



This implies that the coefficients c(r, no, ni) also satisfy the following differential interlacing 
property. 

Corollary A. 5 For any r > 1, any < no < d/2 — r — 2, any 1 < n% < d/2 — r, we have that 

c(r — l,no + 2, ni — 1) — c(r — l,n + l,ni) < c(r, n + 1, n\ - 1) - c(r, n , ni) 

< c(r - l,n + l,ni) - c(r - l,n ,ni + 1). 
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Proof. For r = 1, 1 < ni < | — 1 and no < d/2 — 3, let a = c(r — l,no + 2, m — 1), 
6 = c(r — 1, no + 1, n\) and c = c(r — 1, no, ni + 1). Observe that o = 1 — rf / 2 +2-ni ' ^ = ^~ d/2+i-m 
and c = 1 — d j2_ ni , implying that a > b > c and a — b < b — c. Thus, Lemma A4 implies the 
result for r = 1. 

We now proceed by induction and assume the result true for r — 1 > 1. Defining a, b and c 



as above, we know from Corollary A. 2 that a > b > c and from the inductive hypothesis that 



a — b < c(r — 2, no + 2, ni) — c(r — 2, no + 1, n\ + 1) < b — c. Lemma A. 4 then implies the result 
for r. ■ 



Using Corollary [A.5| repeatedly, we derive the following corollary. 

Corollary A. 6 For any r > 1, no, n\ > suc/i t/iai s = r + no + ni < d/2, we have that 
c(r — 1, no + 1, ni) — c(r — 1, no, ni + 1) < c(0, 1, s — 1) — c(0, 0, s) 



(d/2 + l-s)(d/2 + 2-s) 



Proof. Using Corollary |A.5| , we derive 

c(r — 1, no + I, rix) — c(r— 1, no, n\ + 1) < c(r - 1, 1, n + ni) - c(r - 1, 0, n + n\ + 1) 

< c(0,l,s - 1) - c(0,0,s). 



Theorem A. 7 For any r, no, n\ > suc/i i/iai s = r + no + ni < d/2 — Va + 3/2, we have that 
c(r,n ,ni) = c+(r,n ,ni). 



Proof. The proof is by induction on r. The base case is obvious. Assume the result is true for 
r — 1. This implies that c(r — l,no + l,ni) = c + (r — l,no + l,ni) and c(r — l,no,ni + 1) = 
c + (r — l,no,ni + 1); we denote respectively by a and b these two quantities. The result would 
then follow if we can show that 

a „ - 1)6 + 1 



1 — b + a p 

where p = d—no—n\. This inequality is equivalent to pa < l — b+a+(p—l)b—(p—l)b 2 +(p—l)ab, 
or to (1— b){a— b)(p— 1) < 1—6. Since 6 < 1, we need to prove that a— b < = d _ no _ m _ 1 ■ This 



follows from Corollary A.£ since we have that a — b < — , — 7 < t~t=, h~i=, r < 

J 1 — - 1 (d/2+l-s){d/2+2-s) — (Vd-0.5)(v5+0.5) 



1 < 1 



d—1 — d— no— ni— 1' 



